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The invention relates 
to a reception method to be 
used in a CDMA system 
and to a CDMA receiver, 
comprising an antenna 

(40) , radio frequency parts 

(41) , an A/D converter 

(42) , an adaptive linear 
prestage (43a to 43d) and 
an adaptive non-linear 
detector means (44), 
the detector means (44) 
detecting several users' 
signals simultaneously 
and being (44) responsive 
to a received signal for 
correcting parameters to 
be used for detection to 
correspond to signal states 
of the received signal. 
For an optimal detection 
of the received signal, an 
output signal of the linear 
prestage (43a to 43d) of 
the receiver according to 
the invention supervises 
setting the parameters for 

the adaptive non-linear detector means (44). 
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Reception method and CDMA receiver 



The invention relates to a reception method to 
be used in a CDMA system, in which signals of several 
users are detected simultaneously from a received sig- 
nal, an adaptive nonlinear decision rule is utilized for 
detection and parameters of a detector are supervised 
on the basis of the received signal to correspond better 
to signal states of the received signal. 

The CDMA is a multiuser system based on spread- 
spectrum technique, the application of which system to 
cellular radio systems has started recently, besides the 
previous FDMA and TDMA systems . The CDMA has several ad- 
vantages compared to the previous methods, such as sim- 
plicity of frequency planning and spectrum efficiency. 

In the CDMA method, a narrow-band data signal 
of a user is multiplied to a relatively broad band by 
a spreading code having a considerably broader band than 
the data signal. Bandwidths used in known test systems 
are for instance 1,25 MHz, 10 MHz and 25 MHz. In con- 
nection with the multiplication, the data signal spreads 
to the whole band to be used. All users transmit simul- 
taneously by using the same frequency band. Each connec- 
tion between a base station and a mobile station uses 
its own spreading code and the signals of the users can 
be separated from each other in receivers on the basis 
of each user's spreading code. The purpose is to select 
the spreading codes in such a way that they are mutually 
orthogonal, i.e. they do not correlate with each other. 

Correlators in a CDMA receiver implemented in 
a conventional manner are synchronized with a desired 
signal, which is detected on the basis of spreading 
code. The data signal is returned to the original band 
in the receiver by remultiplying it by the same spread- 
ing code as at the transmission stage, in an ideal case. 



the signals multiplied by some other spreading code do 
not correlate and do not return to the narrow band. Thus 
they appear as noise with respect to the desired signal. 
Accordingly, the aim is to detect a desired user's sig- 
nal from among several interfering signals. In practice, 
spreading codes are not decorrelatable and other users ' 
signals make the detection of the desired signal more 
difficult by distorting the received signal nonlinearly. 
This interference caused by the users to each other is 
called a multiuser interference. 

The single-user detection method described 
above is not optimal, because it ignores in connection 
with detection the information included in other users' 
signals. Additionally, conventional detection is not 
capable of correcting nonlinearities , which are partial- 
ly caused by nonorthogonal spreading codes and a distor- 
tion of a signal on a radio path. An optimum receiver 
considers the information included in the signals of all 
users so that the signals may be detected optimally by 
using the Viterbi algorithm, for instance. An advantage 
of this detection method is that bit error ratio curves 
of the receiver resemble a situation of the single-user 
CDMA system with no multiuser interferences occurring. 
No near-far problem exists, for instance. A near-far 
problem is called a situation when a transmitter close 
to a receiver covers with its transmission the trans- 
mitters located farther away. The most serious defi- 
ciency of the Viterbi algorithm is that the computation- 
al intensity required increases exponentially with an 
increasing number of users. For instance, a ten-user 
system having a bit rate of 100 kbit/s using QPSK modu- 
lation would require 105 millions of measures per second 
for a computation of the likelihood function. This pre- 
vents in practice an implementation of the optimum 
receiver. 



An optimum receiver can, however, be approxi- 
mated by different methods. As prior art are known dif- 
ferent kinds of methods for simultaneous multiuser de- 
tection. To the best known methods belong linear multi- 
user detector, decorrelating detector and multistage 
detector. These methods are described in more detail in 
the references Varanasi, Aazhang; Multistage detection 
for asynchronous code division multiple access communi- 
cations, IEEE Transactions on Communications, vol 38, 
pp. 509-519, Apr 1990, Lupas, Verdui Linear multiuser 
detectors for synchronous code-division multiple access 
channels, IEEE Transactions on Information Theory, vol 
35, no. 1, pp. 123-136, Jan 1989, and Lupas, Verdut 
Near-far resistance of multiuser detectors in asynchro- 
nous channels, IEEE Transactions on Communications, vol 
38, Apr 1990. Other known multiuser detection methods 
are disclosed in U.S. Patents 5353300 and 5343496 re- 
ferred to here. All these methods have, however, the 
drawback that they do hot track changes taking place on 
a radio channel. 

Accordingly, the present invention sets forth 
a novel manner of approximating an optimum receiver. The 
method in question is more resistant to interferences 
occurring both on transmission path and in transmitter. 
Traditional multiuser detection algorithms are fixed to 
a predetermined channel model, on the basis of which 
they have been designed. The method of the invention is 
not interested in a theoretical channel model, since the 
algorithm itself tends to model distortions occurred on 
the channel. The method is adapted to the prevailing 
situation, even if the origin of interferences were not 
known. For instance, a received signal may contain 
transmissions the spreading code of which is not known 
by the receiver. These may be e.g. transmissions moni- 
tored from the region of a neighbouring cell. The adap- 



tation of the method is faster than that of the previous 
neural network applications. 

This is achieved by means of a reception method 
of the type set forth in the preamble, which method is 
characterized in that the received signal is processed 
before detection at an adaptive linear prestage, which 
supervises setting the parameters for the adaptive non- 
linear detector. 

The invention relates further to a CDMA re- 
ceiver, comprising an antenna, radio frequency parts, 
an A/D converter, an adaptive linear prestage and an 
adaptive nonlinear detector means, the detector means 
detecting several users' signals simultaneously and 
being responsive to a received signal for correcting 
parameters to be used for detection to correspond to 
signal states of the received signal. The CDMA receiver 
according to the invention is characterized in that an 
output signal of the linear prestage supervises setting 
the parameters for the adaptive nonlinear detector 
means . 

By means of the method of the invention, an 
optimum receiver can be approximated with a desired 
accuracy. The receiver according to the method adapts 
quickly and accurately to randomly time-varying propaga- 
tion conditions on a radio path, which conditions dis- 
tort a received signal. In such systems, the detector 
according to the invention adapts well by means of a 
very little amount of learning information. By combining 
several learning algorithms in such a way that the most 
suitable method for each situation is used, a very short 
learning time can be achieved. As to traditional neural 
network applications, a realization thereof has been 
prevented in practice by the length of the learning 
time.. 
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Accordingly, in a preferred embodiment of the 
invention, the adaptive detector is realized by means 
of a neural network, such as an adaptive signal point 
system, in which each point of the signal point system 
corresponds to one possible combination of signals 
transmitted by several users. The points of the adaptive 
signal point system are positioned on right locations 
e.g. by means of a specific training period included in 
a received signal, in the preferred embodiment, the 
adaptive linear prestage supervising the detector per- 
forms an estimation of channel parameters. According to 
a second preferred embodiment, the points of the adapt- 
ive signal point system are counted in an unsupervised 
manner by means of a self-organizing map, for instance. 
According to a third preferred embodiment of the inven- 
tion, both above-mentioned initialization methods of the 
signal point system can be used in optional order and 
alternately, if necessary. Further, decision feedback 
methods can be utilized for supervising the neural net- 
work. 

In the following, the invention will be de- 
scribed in greater detail with reference to the examples 
according to the attached drawings, in which 

Figures la and lb illustrate the form of a re- 
ceived signal at matched filter outputs, 

Figure 2 illustrates an example of points indi- 
cated by code vectors, 

Figure 3 shows an example of computing the 
nearest code vector, 

Figure 4 shows the structure of a receiver 
according to the invention, 

Figures 5a and 5b illustrate the similarity of 
a traditional signal decision function and a hyperbolic 
tangent function and 



Figures 6a and 6b illustrate an example of a 
one-layer and two-layer neural network. 

Signals modulating in digital data transmission 
obtain only discrete values, such as ±A e , ±3A C , at sam- 
pling moments. Accordingly, these discrete values shall 
be identified in a receiver from an often distorted sig- 
nal having crossed a radio path. Figure 1 shows an ideal 
undistorted signal pattern of two users, i.e. a point 
density function of received signals, where the peaks 
of the function are situated at crossed points. Each 
point of a two-dimensional pattern signifies one possi- 
ble received signal value, which depends on the values 
of the signals transmitted by the users. For instance, 
point Al could signify a situation (1,1), meaning that 
a first user has transmitted the value 1 and a second 
user the value 1. Correspondingly, point A2 could sig- 
nify a situation (-1,1), meaning that the first user has 
transmitted the value -1 and the second user the value 
1. Point A3 could signify a situation (-1,-1) and point 
A4 a situation (1,-1). if there were three users, the 
pattern would be three-dimensional, and the dimension 
of the pattern grows with an increasing number of users, 
respectively. 

Figure 2 illustrates a distortion of a signal 
pattern caused by nonorthogonal codes and occurred on 
the radio path of a receiver at the output of spreading- 
code-matched filters. The peaks of the point density 
function have spread and moved due to the distortion. 
The received signal points have moved from their ideal 
locations, and the task of the receiver is to interpret 
the received signals to belong to some of the predeter- 
mined signal points. 

If decisions were made fully linearly, plenty 
of faulty decisions would appear on account of the dis- 
torted point system, as is seen from Figure 2. By means 



of the method according to the invention, it is possible 
to realize for instance piecewise linear decision bound- 
aries, by which the optimal nonlinear detection can be 
approximated with a desired accuracy. 

Suppose that the system has K users, i.e. CDMA 
transmitters, each of them having a specific spreading 
code of its own diff erring from the others 



s 



r * (t)- |>* w iL. e (t -w 

where the jth chip of the *th user's spreading code is 
marked, k = 1,2..K. T c is the length of the chip. The 
waveform of the *th user is restricted within [0,TJ. 
Each user transmits in the same frequency band data sym- 
bols e A modulated by the specific spreading code of its 
own, where A is the used symbol alphabet. The task of 
the receiver is then to demodulate a signal, which is, 
by using e.g. BPSK modulation method, of the form 

rU) ti hkit) S * ( t - ir " T * ) +n t> ten 

where 2P+1 is the number of symbols to be transmitted, 
n t is a noise term, T is the duration of the symbol and 
S {-1,1} signifies the kth user's information bit 
in the ith time slot, x, £[0,T] signifies the *th user's 
time deviation and h k (t) an impulse response of the *th 
user's physical channel. For the sake of clarity, it is 
assumed below that x^O, Vke {1,..K>, which means that 
the system is synchronous. However, the invention can 
be applied also to an asynchronous system in a corre- 
sponding manner. 
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Let us suppose further that the impulse re- 
sponse of a multipath channel is of the form 

^(t)^h ktl U)b(t-x ktl U)) 

where the kth user's 1th complex channel tap is marked 
h kfl E C and they are assumed to be either constants or 
fading as a function of time. 

At multiuser detection, decisions on received 
signals are made simultaneously for all K users. In this 
example, it is supposed that the channel has Gaussian 
noise and the bits transmitted by all K users simulta- 
neously at a predetermined moment are marked in a vector 
form b e {-1,1 } K . it is known that a maximum likelihood 
decision to be made in the receiver is based on a loga- 
rithmic likelihood function 

b <0t a **ff be * max {ly'b - b' Kb ) 

where B is a matrix of crosscorrelations between the 
used spreading codes, i.e. (B) k) ■ <S 1 ,S j > , i,j ■ 
1,2,...,K and the vector y comprises the matched filter 
outputs of the receiver. The above equation can be 
solved by means of a Viterbi type algorithm, but compu- 
tational complexity prevents an implementation of an 
optimum receiver of this type in practice, as has been 
told earlier. 

Accordingly, a signal received by the receiver 
has the above form r(t). The signal can be processed 
e.g. by using filtering at chip frequency or filtering 
at symbol frequency. The first manner can be described 
by the formula 

rlj) = j x(t)dt 
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assuming that one sample per chip is taken. On the other 
hand, the latter manner is illustrated by the formula 

m 

4fi 9 j rU)s k (t-v kl -iT b )dt 



It will be explained below how the above output 
signals of filters can be described for certain neural 
network structures and how a neural network of the 
method according to the invention is supervised by means 
of learning algorithms and which advantages are achieved 
by the method of the invention. 

Various decision rules differ from each other, 
as far as the used metric and searching algorithm are 
concerned, by which algorithm the state describing the 
signal best is searched for. Different metrics and dis- 
tance measures are described in the reference Teuvo 
Kohonent Self -Organization and Associative Memory, 
Springer-Verlag, Berlin-Heidelberg-New York-Tokio, 3rd 
edition, 1989, and as an example of those is given here 
a measure based on projection or filtering, where the 
vector m,,^ modelling the signal state is the vector to 
which the received signal has the largest projection: 

Another alternative is e.g. a so-called Mahalanobis 
distance or a weighted Euclidean metric, where 

flippt 3 arg min 1%-*^ 

= azsr min ^ (Hj -x) T $ (m ± -x) 



10 



where <j> is a weighting coefficient given for the dis- 
tance and depending on the correlations of the codes. 
Various decision rules and learning algorithms relating 
to them are used in the present invention in order to 
achieve the best result. It shall be mentioned that an 
optimum decision can be realized by the weighted Euclid- 
ean metric, but the complexity of the decision is very 
high. Referring to the above formulas, it is found that 
the vector x of this embodiment is e.g. of the form 

xm [r[j-J], r[j-J+l], r[j+J]] r , 
where the different elements represent chip-matched 
filter outputs influencing the jth decision. Correspond- 
ingly, the form 

can be used, where the elements are obtained from code- 
matched filter outputs. 

Firstly, vector quantization neural networks 
are discussed, and subsequently, feed forward neural 
networks . 

Vector Quantization Neural Network (VQNN) 
methods are often called generally by the term neural 
networks, since the used learning algorithms are con- 
sidered to be neural. 

The VQNN method utilizes a detection method 
based on an adaptive multidimensional signal point sys- 
tem. Received discrete signals are compared with the 
signal point system of the receiver and the received 
signal value is classified to belong to that point of 
the signal point system which is located at the shortest 
calculated distance. In a first preferred embodiment of 
the invention, the adaptive signal point system is cor- 
rected by means of a training period included in the 
received signal. In this manner, the receiver is capable 



of adapting to a distortion of the received signal by 
distorting the signal point system, correspondingly. 

The method described is called Learning Vector 
Quantization (LVQ) and it has been applied earlier in 
connection with pattern recognition problems. The method 
is described in more detail in the reference already 
mentioned Teuvo Kohonen: Self -Organization and Associa- 
tive Memory, Springer-Verlag, Berlin-Heidelberg-New 
York-Tokio, 3rd edition, 1989. 

In another preferred learning algorithm, the 
receiver corrects the adaptive signal point system by 
means of a Self-Organizing Map (SOM). No separate train- 
xng period is then needed. This self-organizing map 
method has been applied earlier in connection with pat- 
tern recognition problems in the same way as the LVQ and 
it has been described in greater detail both in the ref- 
erence mentioned above and in the references Teuvo Koho- 
nen: The Self-Organizing Map, Proceedings of The IEEE, 
78(9): pp. 1464-1480, 1990, Kohonen: Generalizations of 
the Self-organizing Map, Proc. of the International 
Joint Conference on Neural Networks, IJCNN'93, Nagoya, 
Japan, October 25-29, 1993, Kohonen: Things You Haven't 
Heard about the Self-Organizing Map, Proceedings of the 
1993 IEEE int. Conf. Neural Networks, San Francisco, 
USA, March 28- April 1, 1993, pp. 1147-1156, and Koho- 
nen, Raivio, Simula, Henriksson: Start-up Behaviour of 
a Neural Network Assisted Decision Feedback Equalizer 
xn a Two-Path Channel, Proc. of IEEE Int. Conf. on Com- 
munications, Chicago, USA, June 14-18, 1992, pp. 1523- 
1 52 7 • 

In the VQNN methods, the detection thus util- 
izes a set of classification points and a received sig- 
nal is classified to the point which is considered to 
be nearest. The method or the decision rule by which the 
nearest classification point is determined can be 
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changed during detection. In a CDMA application, a set 
of nearest classification points can be calculated by 
projection, for instance, and the final decision by 
means of a weighted (Euclidean) metric. Such a multi- 
phase solution is computationally efficient in CDMA 
applications, where the number of possible classes is 
large . 

If the channel parameters and the used spread- 
ing codes are known by the receiver and the receiver 
uses code-matched filters, code vectors can be deter- 
mined a priori by the formula 

M » RWB, 

where W is a diagonal matrix consisting of complex tap 
coefficients multiplied by signal energies, B is a 
matrix of bit combinations (a preferred representation 
includes K linearly independent bit vectors) and R Lb 
a matrix of crosscorrelations. Accordingly, if the 
delays of the codes are known and there is no need for 
code tracking, the representation is very simple. If the 
code delays are not known, as is the case in a cellular 
radio system, a receiver prestage can be used, consist- 
ing of a number of N-dimensional matched filters, which 
span a signal space, but are not necessarily matched to 
the spreading codes or to the above-mentioned chip- 
matched filtering. 

Accordingly, an initialization of a VQNN net- 
work can be performed in the method of the invention by 
means of an adaptive prestage calculating the channel 
parameters, on the basis of which are obtained good 
initial values for the network before the actual train- 
ing starts . Training can thus be speeded up considerably 
compared to the previous methods. 
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The LVQ and SOM methods will be described below 
by way of example from the point of view of the method 
according to the present invention. 

An optimum CDMA receiver simultaneously used 
by several users functions nonlinearly, responsive to 
sufficient statistics given by spreading-code-matched 
filters, in this context, the LVQ and SOM methods can 
be used for estimating optimal Bayesian decision bound- 
aries. The Bayesian decision boundaries separate the 
classes from each other in such a way that as few errors 
as possible occur. 

Each possible discrete signal state can be con- 
sidered to constitute its own class c^. Each class is 
determined by a number of code vectors, the dimension 
of which can be determined depending on application, in 
a synchronous CDMA, the dimension of the code vectors 
can be for instance the same as the number of users. In 
an asynchronous CDMA, the dimension is optimally K(P+1), 
but the number of computations will then grow high. As 
per situation, a suboptimum information arrived during 
symbol time [0,TJ can also be accepted, the dimension 
of the code vector being then e.g. 2(K-1). 

The number of code vectors per class depends 
on the approximation accuracy desired, if there is only 
one code vector in each class, the decision boundaries 
are linear. The more code vectors have been set, the 
more accurately the decision making approximates the 
optimum receiver, the decision boundaries being piece- 
wise linear and the complexity increasing with an in- 
creasing number of code vectors. Each class can also 
contain a different number of code vectors. Each code 
vector indicates some point representing the class. 
After a preliminary number of code vectors has been set 
for each class, the system adjusts the code vectors to 
indicate some point preliminarily. Channel parameters 
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can be utilized for selecting this preliminary point. 
During signal detection, the system adjusts the code 
vectors to indicate the optimal point at each moment. 

Assume that a certain number I of code vectors 
has been set for the system. All users' discrete signal 
samples to be received from matched filter outputs are 
marked by a vector y at each moment. Code vectors are 
marked m if i»l,...I. The code vector m c which is closest 
to the signal sample y is obtained e.g. by calculating 
an Euclidean distance 

c = arg min {ly-znj} . 

The above manner of distance calculation is only one 
possible method for determining a distance. Other man- 
ners have been set forth in the references cited above. 
In the method according to the invention , the manner of 
calculating the nearest code vector can be changed dur- 
ing symbol detection , as has been explained previously 
already. The distance metric or decision rule to be used 
may be changed e.g. on the basis of received signal, 
channel properties or receiver stage. 

Code vectors m A are now corrected on the basis 
of a received signal sample according to the following 
formulas, for instance: 



m e (t+l) = m e (t)Ht e (t)(y(t)-m,(t)), if y,^ e <o ± 
m e (t+l) = m e (t)-a e (t)(y(t)-m € (t)) i if y t o>„ 
m c z to jf i*j 

m x (t+l) = m^t) , otherwise 
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Accordingly, the uppermost equation in the above group 
of equations deals with a case when a signal sample has 
been classified right, the second equation a situation 
when a signal sample has been classified wrong. Other 
correction equations are presented in the references 
cited above. Individual learning coefficients a can be 
determined for instance as follows: 

H-s(t)« c (t-l) 



where s(t) - 1 for the right classification and -1 for 
the wrong classification. The points of the signal point 
system thus adapt according to the received signal and 
the decision making accuracy is maintained, though the 
signal is distorted and the distortion varies as a func- 
tion of time. 

A signal received according to the LVQ contains 
a learning period, according to which the neighbourhood 
can be adjusted. 

In the two-dimensional example of Figure la, 
each signal point V.A, can be considered to constitute 
its own class o^. For instance five code vectors can be 
selected to represent each class A 1 ..A 4 , figure 2 show- 
ing the points indicated by the vectors. Figure 3 illu- 
strates a calculation of the nearest code vector. A re- 
ceived signal is a vector 20, and distances between the 
code vectors and the vector 20 are calculated according 
to the method. Distance vectors 23 and 24 between the 
received vector 20 and the code vectors 21 and 22 are 
drawn in Figure 3 as an example. The shortest of them 
is selected, which is 24 in the case of the figure. In 
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this way the system classifies that the vector 20 be- 
longs to the class represented by the code vector 22. 

In the method based on a self-organizing map, 
the learning process, i.e. the correction of code vec- 
tors, differs from the LVQ therein that a received sig- 
nal does not include any specific learning period, but 
the self-organizing map groups the code vectors directly 
on the basis of the received signal to a location where 
the number of received signal points is higher. Conse- 
quently, it adapts automatically. The signal points are 
not divided into classes either, as in the LVQ. A topo- 
logical neighbourhood N c of a point indicated by a code 
vector m,; is constituted by surrounding neighbouring 
points at a desired depth. A correction of the location 
of the neighbourhood of the received code vector can now 
be performed for instance on the basis of the following 
equations : 

%(t+l)=in 4 (t)+a(y(t)-in i (t)), i=c 

m ± ( t+i) mm i ( t ) +0 (y ( t) -j^ ( t) ) , ieN c , i+c 

Other correction equations have been set forth in the 
references specified above. 

Accordingly, the points of the adaptive signal 
point system can be corrected to their right locations 
either by means of the LVQ or the SOM. It is also possi- 
ble to use both above methods alternately. For instance, 
the signal points can be accumulated on their right lo- 
cations by utilizing a self-organizing map, and subse- 
quently, the class points for the LVQ are determined by 
means of training vectors. After this, the receiver may 
use the SOM for keeping the signal points on their cor- 
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rect locations. Correspondingly, if there exists a pre- 
liminary estimate for the class points, the LVQ method 
can be used in the beginning, and in case of a change 
of channel, for instance, the data can be accumulated 
again by the SOM method. If necessary, the code vectors 
are classified by means of a training set. 

A drawback of a signal point system of above 
type is that the size of the signal point system can be 
very large, whereby searching for the optimal signal 
point system is complicated. However, the searching can 
be concentrated only on a small subset, as follows: 
1) A suboptimal decision is initially made on part of 
the dimensions of a vector x. The suboptimal decision 
can be realized by means of a filter bank or a decorre- 
lator, for instance. 2) A complete searching for the 
vector x is performed in such a way that part of the 
dimensions are fixed at step 1). At multiuser detection, 
strong signals and the dimensions of the vector x corre- 
sponding to those signals can preferably be detected 
suboptimally and an optimal searching based on Euclidean 
metric can be performed for the weakest users only. 

Subsequently, an alternative manner of realiz- 
ing an adaptive nonlinear decision is described. An es- 
sential difference with respect to the above is that 
here the detection is based on nonlinear filtering and 
the training is based on minimizing a mean square error 
by a nonlinear algorithm. Nonlinear detector training 
can here be initialized by means of a linear adaptive 
prestage in such a way that the training is speeded up 
essentially. Based on LMS or MMSE criteria, the adaptive 
linear prestage can, for instance, estimate receiver 
filters, from which can be calculated a correlation 
matrix, which is, in turn, utilized for nonlinear detec- 
tion. 
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A neural network structure of another type is 
discussed next, to which the reception method according 
to the invention can be applied. Peed forward neural 
network refers to a non-cyclic network, in which an in- 
put vector x lB e R d is mapped to an output vector 
x out e R * according to certain weighting coefficients W AJ 
and possible nonlinearities 6^. Parameters d, 1 and q 
define the dimensions of input, hidden and output layer. 

It is previously known that a cell of a neural 
network calculates the output value by means of the 
formula 

*i 88 fQp WyXj) + bias 

where f is some continuously differentiate nonlinear 
function, such as 

fix) = tanh(x) « e*-e'* 

e* + e~* 

or 

fix) « — I— 
l + e-x 

The hyperbolic tangent f(x) « tanh(x) is a particularly 
suitable nonlinear element, since it is close to a tra- 
ditional detector based on signal decision. This is il- 
lustrated in Figures 5a and 5b, Figure 5a showing said 
function y * sgn(x) making a traditional signal decision 
and Figure 5b showing the function y ■ tanh(x). 

The above-mentioned formula of the output value 
can be written in the form 

x t = f(wlx) + bias 

This corresponds to a conventional matched filter pre- 
stage, for which wf s s k * The above formulas are 

intended for real input signals and they can also be ap- 
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plied in a complex apace, which is typically used in 
telecommunication applications in such a way that the 
number of in- put connections is doubled (separate con- 
nections for the real and imaginary parts of each user) . 
Another alternative is to use complex neurons and a 
corresponding learning rule, as explained in the refer- 
ence N. Benvenuto, M. Marchesi, F. Piazza, A. Uncinii 
A comparison between real and complexvalued neural net- 
works in communication applications, Proc. Int. Conf. 
Artificial Neural Networks, Espoo, Finland, June 1991, 
pp. 1177-1180. 

A multilayer neural network is assembled by 
connecting neuron outputs of layer i to inputs of layer 
Figure 6a illustrates one neuron and Figure 6b a 
two-layer network. 

In case of a two-layer network, a mapping of 
an input vector takes place according to the following 
equations t 

out, * fineti) 



net 2 



w 2 [outj 



y * f(net 3 ) 

where w,, w 3 and f are determined as above, net, and out, 
map a calculation of a first layer, net, and y a calcu- 
lation of a second layer, when y is the output value of 
the network. When the input values are in matrix form, 
the network maps the input data according to the follow- 
ing formulas: 
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[ 1 * W 1 ] 



Y » f(N,), 

where f is a suitable nonlinear function and W 2 and W 2 
are weight matrices of the layers. 

The mapping of a neural network is marked by 
g() as follows: 

g t R 1 x J?-** 0 

y - g(x,w) 

where i, p and o signify dimensions of input, hidden and 
output layer and the vector w comprises the components 
of the matrices tr t and ft, mentioned previously. Neural 
network training comprises setting the vector w (or the 
matrices w i and W 3 , respectively) in such a way that the 
mapping performed by the network corresponds to the de- 
sired mapping as accurately as possible. In a telecommu- 
nication application, the aim is thus to train the net- 
work in such a way that the received signal can be de- 
tected as faultlessly as possible by means of the map- 
ping. The correctness of the mapping can be measured 
typically by a mean square error: 
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where (x lf tjji-l, . . .N is a set of training and target 
vector pairs. Target vectors are known values, by means 
of which the network can be trained. With large dimen- 
sions, a calculation according to the above formula 
requires a high capacity. 

For feed forward network training, a plurality 
of methods have been developed, such as e.g. backpropa- 
gation training utilizing a gradient method for minimiz- 
ing the error function. In the gradient method, a gradi- 
ent of the error function with respect to the weight 
function w is calculated, on the basis of which gradient 
the weight function is updated: 

* w OJ(f + TV E(w), 
where T determines the size of correction step. 

For the present invention, it is not essential 
as such which training method is used after the initial 
values have been set. other possible training methods 
are disclosed in the references R. Battiti: Accelerated 
backpropagation learning: Two optimization methods, 
Complex Systems 3, pp. 331-342, 1989, Poggio, Girosi: 
A theory of networks for approximation and learning, MIT 
memo not 1140, 1989 and J.F. Shepanski: Fast learning 
in Artificial Neural Systems: Multilayer perception 
training using optimal estimation, ICNN, 1988. 

Neural network training is typically started 
by setting random values for weight coefficients, which 
values will then be trained towards the correct values. 
The method of the invention utilizes an adaptive pre- 
stage giving information, e.g. channel parameters, by 
which information initial values corresponding better 
to the real situation can be set for the weight coeffi- 
cients, and in this manner, the actual training can be 
speeded up considerably. This corresponds to the situ- 
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ation described earlier in connection with VQNN networks 
when initial values were set for the code vectors. 

By using the previously presented two-layer 
network as an example, it is also possible to give the 
values of the weight coefficient matrix W 1 randomly and 
to calculate the matrix W 3 in the manner mentioned in 
the above Shepanski reference, for instance. When known 
input values X, to which correspond output values Y, and 
known target values T are used, a minimization is neces- 
sary 

|r- H|» |r- flw a o x )i z r 
or alternatively 

The above equations do not necessarily lead to the same 
final result. The solution of the latter equation is of 
the form 

where q* is a pseudoinverse matrix of o,. This approach 

does not give the optimal weight matrices, but it pro- 
duces the fastest learning in general. It is also possi- 
ble to apply linear regression techniques to every gra- 
dient iteration when the value of the matrix W x is up- 
dated and to update the W, by means of linear 
regression. 

One way of initializing a network is to utilize 
a conventional decorrelating detector for calculating 
the weighting coefficients of the first network layer. 
The weighting coefficients of the second layer can be 
calculated by the backpropagation method or by the She- 
pansky method. Further, it is possible to update the 
weighting coefficients obtained by means of the decorre- 
lating detector of the first layer by the backpropaga- 
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tion method. The known training methods of the network 
have already been described previously. 

In the reception method according to the inven- 
tion, the learning can be speeded up considerably by 
using known learning methods in a novel manner, which 
is especially suitable for a telecommunication applica- 
tion. A network training is performed assuming initially 
that the network does not contain nonlinearities in the 
hidden layer. Once sufficiently good initial values have 
been obtained, the nonlinear elements of the hidden 
layer are taken into consideration and the following 
network layers are trained for instance recursively, in 
the method of the invention, a training period can be 
used in each phase, though a specific advantage of the 
invention consists in that long training periods are not 
needed. 

If a set of matched filters is used as a detec- 
tor prestage, initial weighting coefficient estimates 
can be mapped to the next layers in a CDMA application 
directly by means of the formula W, = w 0 - j, where I 
refers to an identity matrix having the same dimension 
as W. After the initialization, it may be proceeded to 
decision feedback training. 

Figure 4 illustrates the structure of a CDMA 
receiver according to the invention, the receiver being 
in this example a base station receiver. However, the 
invention is suitable for being used also in a mobile 
station, respectively. The receiver comprises an antenna 
40, by which a received signal is brought via radio fre- 
quency parts 41 to an A/D converter 42. The converted 
signal is brought to means 43a to 43d preprocessing the 
received signal. In a preferred embodiment of the inven- 
tion, the means perform an estimation of channel parame- 
ters. The means may be realized e.g. by RAKE receivers, 
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each of them receiving a signal transmitted by one user. 
The receiver additionally comprises a control unit 45 
controlling the operation of the device. Each RAKE re- 
ceiver includes several separate correlators, each of 
them capable of receiving one multipath propagated sig- 
nal component. These received signal components are 
preferably combined in a RAKE receiver. The structure 
of the RAKE receiver has been described in more detail 
in the reference G. Cooper, C.McGillem: Modern Communi- 
cations And Spread Spectrum, McGraw-Hill, New York 1986, 
Section 12. 

Consequently, each RAKE receiver 43a to 43d 
receives one user's signal (and its multipath propagated 
components). Prom each RAKE receiver, the signal is 
brought to an adaptive detector 44 detecting the re- 
ceived multiuser signals simultaneously by utilizing an 
adaptive nonlinear decision rule and the above-described 
initialization and training methods according to the 
invention. 

Accordingly, the receiver of the invention can 
be realized also without RAKE receivers. The efficiency 
of the LVQ and SOM methods is sufficient as such in a 
multipath case, if the dimension of the code vectors is 
increased to correspond to the spreading caused by an 
impulse response. This concerns also a feed forward 
neural network. 

Also some other linear or nonlinear conversion 
of a received signal may be preformed in the preproces- 
sing means, such as multiplication by decorrelating 
matrix, which leads to a decision statistics of the 
decorrelating detector. 

Though the invention has been described above 
referring to the examples of the attached drawings, it 
is obvious that the invention is not restricted to them, 
but it can be modified in many ways within the scope of 
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the inventive idea set forth in the attached claims. For 
instance, neural networks of different types can be 
cascade connected in a desired manner so that a training 
as effxcxent as possible and a decision rule as simple 
and good as possible can be provided. 
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Claims: 

1. A reception method to be used in a CDMA sys- 
tem, in which signals of several users are detected 
simultaneously from a received signal , an adaptive non- 
linear decision rule is utilized for detection and para- 
meters of a detector are supervised on the basis of the 
received signal to correspond better to signal states 
of the received signal, characterized in 
that the received signal is processed before detection 
at an adaptive linear prestage, which supervises setting 
the parameters for the adaptive non-linear detector. 

2. A method according to claim 1, char- 
acterized in that the adaptive linear prestage 
performs an estimation of channel parameters . 

3. A method according to claim 2, char- 
acterized in that predecisions are calculated 
for the nonlinear detector on the basis of the channel 
parameters . 

4. A method according to claim 1, char- 
acterized in that the decision rule to be used 
in the adaptive nonlinear detector and a training algo- 
rithm suitable therefor are changed on the basis of the 
received signal and the stage of the receiver. 

5. A method according to claim 1, char- 
acterized in that the decision rule to be used 
in the adaptive nonlinear detector and the training 
algorithm suitable therefor are initialized on the basis 
of the channel parameters. 

6. A method according to claim 1, char- 
acterized in that the parameters of the adap- 
tive nonlinear detector are supervised on the principle 
of learning vector quantization. 

7. A method according to claim 1, char- 
acterized in that the parameters of the adap- 
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tive nonlinear detector are supervised by a self-orga- 
nizing map. 

8. A method according to claim 1, char- 
acterized in that the parameters of the adap- 
tive nonlinear detector are supervised by means of deci- 
sion feedback. 

9. A method according to claim 1, char- 
acterized in that the adaptive nonlinear detec- 
tor is realized by means of a neural network. 

10. A method according to claim 9, char- 
acterized in that the output of a nonlinear 
neuron depends on the distance between a signal point 
modelling ,the neuron and a signal arriving at the neuron 
input. 

11. A method according to claim 1, char- 
acterized in that the adaptive nonlinear detec- 
tor is realized by means of an adaptive multidimensional 
signal point system. 

12. A method according to claim 11, char- 
acterized in that each point of the adaptive 
signal point system represents one possible combination 
of signals transmitted by several users. 

13. A method according to claim 12, char- 
acterized in that a received symbol is classi- 
fied to a signal point which is closest to the received 
symbol . 

14. A method according to claim 13, char- 
acterized in that the nearest signal point is 
searched for in at least two steps in such a way that 
in the first step is used a suboptimal linear decision 
rule, by which the possible signal points are reduced, 
and that the signal point corresponding best to the re- 
ceived symbol is searched for in the reduced signal 
point system by means of an optimal distance measure. 
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15. A CDMA receiver, comprising an antenna 
(40), radio frequency parts (41), an A/D converter (42), 
an adaptive linear prestage (43a to 43d) and an adaptive 
non-linear detector means (44), the detector means (44) 
detecting several users' signals simultaneously and 
being (44) responsive to a received signal for correct- 
ing parameters to be used for detection to correspond 
to signal states of the received signal, charac- 
terized in that an output signal of the linear 
prestage (43a to 43d) supervises setting the parameters 
for the adaptive nonlinear detector means (44). 

16. A receiver according to claim 15, 
characterized in that the adaptive linear 
prestage (43a to 43d) performs estimation of channel 
parameters . 

17. A receiver according to claim 15, 
characterized in that the receiver com- 
prises means (44) for supervising parameters of a deci- 
sion rule to be used for detection by means of a self- 
organizing map. 

18. A receiver according to claim 15, 
characterized in that the receiver com- 
prises means (44) for supervising the parameters of the 
decision rule to be used for detection on the principle 
of learning vector quantization. 

19. A receiver according to claim 15, 
characterized in that the receiver com- 
prises means (44) for supervising the parameters of the 
decision rule to be used for detection by means of deci- 
sion feedback. 
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